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Abstract: We introduce new test statistic to test the independence of two multi-dimensional random 
variables. Based on the Li-distance and the historgram density estimation method, the test is compared 
" via Bahadur relative efficiency to several tests available in the literature. It arises that our test reaches 
better performances than a number of usual tests among whom we cite the Kolmogorov-Smirnov test. 
Beforehand, large deviation result is stated for the associated statistic. The local asymptotic optimality 
relative to the test is also studied. 
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C*** \ 1 Introduction 

Consider a sample of W 1 x M, d -valued random vectors (X\, Yi), . . . , (X n ,Y n ) with independent and iden- 
tically distributed (i.i.d.) pairs defined on the same probability space. The density function of (X,Y) is 
denoted by /, while f\ and f 2 stand for the density function of X and Y with respect to the Lebesgue 
measure Ai and A2, respectively. In this paper, we are concerned with the problem of independence 
testing and the comparison of performances of various tests. Here, we propose to test the hypothesis 
that X and Y are independent on the basis of the shape of its density from the L\ point of view. More 
precisely, we consider the test of the following null hypothesis 



H : \f(x,y)- f 1 (x)f 2 (y)\dxdy = 0. 



> 
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For this purpose, we consider the histogram density estimates of the density of (X, Y), X and Y. In 
order to define its, let P n = {A n j,j > 1} and Q n = {B n j,j > 1} be a cubic partitions of M. d and 

M. d respectively, and denote by v n , [i n ^\ and [i n ^i the empirical measures associated with the samples 
(X\,Yi), . . . , {X n , Y n ), Xi,..., X n , and Yi, . . . , Y n , respectively. The histogram density estimators of / 
and fi are respectively defined by 

X " 1 

b ■ fn(x, y) = w A wnenever (X, V) G An. 3 X B nj 



and 



Mn,l(-4n,i) 



fn,i(x) = 1 A n '\ whenever x € A n j, 



with f n2 is defined similarly as f n %. To perform the test of the hypothesis H, we consider the following 
statistic 



V n = j I \fn(x,y) - f n ,i{x)f n>2 (y)\dxdy. 

The work performed here deals with the Li-distance large deviation result for statistic V n and the 
comparison of their performances in the Bahadur efficiency sense with some other tests available in the 
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literature. Furthermore, the local asymptotic optimality relative to the test is studied. The independence 
hypothesis may be expressed in various ways. Either the case pertaining with the density and the 
Li-distance in study in this paper, the independence of X and Y, based on the equality of the joint 
distribution function and the product of its marginals, may be defined by the following hypothesis 

Hi : F(x,y) = F 1 (x)F 2 (y), for all (x,y) £l d x R d ' , 

where F is the distribution function of (X,Y), and F%, F 2 are the marginal distribution functions. A 
number of statistics have been proposed to test the hypothesis H\ or any other hypothesis form against the 
general alternative that the equality in H\ is violated at least one point or narrower classes of alternatives. 
Among the most studied ones in the univariate case, we quote the Kolmogorov-type statistic has been 
introduced by [T]. It is defined by 

T n = sup | F n (x,y) - F ni i(x)F n> 2(y) |, 

— oo<x,y<oo 

where F n , F nt i, and F n2 the empirical distribution function, constructed from the initial sample and its 
components. Properties of T n and its various versions have been investigated in several papers among 
whom we quote [1] and [11]. On the basis of the empirical distribution function several other tests have 
been proposed for testing H. The well-known statistics are 

= / J[Fn(x,y)-F n7l (x)F n>2 (y)} k 

x qi (F nA (x))q 2 (F ni2 (y))dF nil (x)dF n:2 (y), (1) 

where /c is a positive integer, and q\ and q 2 are nonnegative weight functions on (0,1). The statistic 
B^ qim for q\(u) = q 2 {u) = sin(-7ru) has been introduced by Koziol and Nemec (1979). For k = 2 and 
Ql = Q2 = 1) a statistic equivalent to (P) was proposed by [8], with its properties later being studied by 
PP, [2] and [10], and by [3] for weights q\ and q 2 not equal to 1. Similar to the Durbin-type statistic in 
the goodness-of-fit testing, [5] proposed the statistic 



M n = sup 

— oo<i<oo 



(F n (x,y) - F n>1 (x)F nj2 (y)) dF n , 2 (y) 



The author also investigated large deviations of this statistic and their various versions under the inde- 
pendence hypothesis. As to linear rank statistics for testing H\, they usually are written in the form 



L n = n 

i=l 



1 a nA (Ri/(n + l)K, 2 (Si/(n + 1)), 



where Ri is the rank of X{ among X±, . . . , X n , Si is the rank of Y{ among Y±, . . . , Y n , and a ni i, a„ i2 are 
some real functions on [0, 1]. As an usual test very close to the linear rank test, we cite the Kendall rank 
correlation coefficient r n (see [7]). It is defined by 



^ sign(i?i - J R i )sign(S' l - Sj). 



n(n — 1) 

Large deviations and exact slope of this statistic have been investigated by [T2] and [T3]. We refer also to 
[9] where the comparison of these statistics by their asymptotic efficiency is done. A statistic very close 
to V n was proposed by [B]. It is defined by 

L n (u n , n n> i x n n>2 ) = | v n {A x B) - fi n) i(A)fi n)2 (B) |, 

AeP^ BeQ' n 
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where P n and Q' n are finite partitions of M d and M rf respectively. Asymptotic properties of the statistic 
L n and, particular, corresponding large deviations were investigated in [6]. However, the methods applied 
in this later paper are not sufficiently strong to derive the Bahadur efficiency. Under the independence 
hypothesis, these authors proved only that for all < ei, < £2 and < 63, 

P (L n (v n , fj, n>1 x M „ >2 ) > ei + e 2 + e 3 ) < 2 m " m - e - ne i/ 2 + 2 m " e -^/2 + 2 -'n e — i/ 2 . 
A significantly more exact large deviation result is given by the following Theorem, proved in this work. 
Theorem 1.1 Under the independence hypothesis H one has 

lim n- 1 logP(y n >A) = - 5 (A) = -^(l + o(l)), A -> 0, 
where the function g is continuous for small A > 0. 
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